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Capacity Regions and Sum-Rate Capacities of 
Vector Gaussian Interference Channels 



Xiaohu Shang, Biao Chen, Gerhard Kramer and H. Vincent Poor 



Abstract 

The capacity regions of vector, or multiple-input multiple-output, Gaussian interference channels 
are established for very strong interference and aligned strong interference. Furthermore, the sum-rate 
capacities are established for Z interference, noisy interference, and mixed (aligned weak/intermediate 
and aligned strong) interference. These results generalize known results for scalar Gaussian interference 
channels. 

I. Introduction 

The interference channel (IC) models the situation in which transmitters communicate with their 
respective receivers while generating interference to all other receivers. This channel model was mentioned 
in [1, Section 14] and its capacity region is still generally unknown. 

In [2] Carleial showed that interference does not reduce capacity when it is very strong. This result 
follows because the interference can be decoded and subtracted at each receiver before decoding the 
desired message. Later Han and Kobayashi [3] and Sato [4] showed that the capacity region of the strong 
interference channel is the same as the capacity region of a compound multiple access channel. In these 
cases, the interference is fully decoded at both receivers. 
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Fig. 1. The two-user MIMO IC. 

When the interference is not strong, the capacity region is unknown. The best inner bound is by Han 
and Kobayashi [3], which was later simplified by Chong et al. in [5] and [6]. Etkin et al. and Telatar 
and Tse showed that Han and Kobayashi 's inner bound is within one bit of the capacity region for scalar 
Gaussian ICs [7] and [8]. Various outer bounds have been developed in [7]-[12]. 

Special ICs such as the degraded IC and the Z interference channel (ZIC) were studied in [13] and [14]. 
Costa proved that the capacity regions of degraded ICs and ZICs are the same for the scalar Gaussian 
case [14]. The sum-rate capacity for the ZIC was established in [13] and [15]. A recent result in [10]— [12] 
showed that if a two-user Gaussian scalar IC has noisy interference, then treating interference as noise 
can achieve the sum-rate capacity. This result has been extended to multi-user Gaussian ICs in [16] and 
[12]. The sum-rate capacity for mixed interference, i.e., one receiver has strong interference and the other 
has weak/intermediate interference, was derived in [11] and [17]. 

In this paper, we study the capacity of the two-user Gaussian vector IC or multiple-input multiple-output 
(MIMO) IC. As shown in Fig. 1, the received signals are defined as 



where the transmitted (column) vector signal of user i which is subject to the average 

covariance matrix constraint 



where Xn,Xi2, . . . ,Xi n , is the transmitted vector sequence of user i, and Sj is a fixed positive semi- 
definite matrix. Inequality A ^ B means that A — B is Hermitian positive semi-definite. The noise z- L is 
a circularly symmetric complex Gaussian random vector with zero mean and identity covariance matrix; 



y 1 = HiXi + H2X2 + zi and 



y 2 = H 3 xi +U 4 x A + Z2, 



(1) 



n 




(2) 
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and Hfc, j = 1, ... ,4, are the complex channel matrices known at both the transmitters and receivers. 
Transmitter i has ti antennas and receiver i has ri antennas. 

For the MIMO IC, Telatar and Tse [8] showed that Han and Kobayashi's region is within one bit 
per receive antenna of the capacity region. Some outer bounds for the capacity region were discussed 
in [18] and some lower bounds for the sum-rate capacity based on Han and Kobayashi's region were 
given in [19]. Recent work in [20] and [21] extended the existing capacity results from scalar ICs to 
MIMO ICs under average power constraints. Specifically, [20] and [21] derived the capacity region for 
aligned strong interference, and the sum-rate capacity for Z interference, noisy interference and mixed 
interference under average power constraints. It should be noted that some of the results in [20] and [21] 
require the channel matrices to be square and invertible, and the noisy-interference sum-rate capacity is 
obtained by requiring all possible covariance matrices of X\ and X2 to satisfy a condition. A partially 
strengthened noisy interference condition for MIMO ICs was later presented in [22] which required only 
that the optimizing covariance matrices of x\ and x^ satisfy the condition of [20] and [21], as long as these 
optimizing covariance matrices have full rank (see [22, Remarks 2 and 3 and Theorem 1]). A special case 
of the MIMO IC, the so-called parallel Gaussian IC where the Hj's are all square and diagonal matrices, 
was studied in [23] and [24], and it was shown that under suitable conditions for channel matrices 
and the power constraints, separate coding among antennas (or the transmit vector entries) and treating 
interference as noise achieves the sum-rate capacity. In addition, the optimal covariance matrices can be 
singular for this special case. Using the result of [25] that beamforming is optimal for the single-user 
detection rate region of the multiple-input single-output (MISO) IC, [22] derived noisy-interference sum- 
rate capacities for symmetric MISO ICs, i.e., the H,, j = 1, ■ ■ ■ ,4, are all row vectors with Hi = H4, 
H2 = H3 and the two users have identical power constraints. 

In this paper, we use the covariance matrix constraint (2) and derive the sum-rate capacity of the MIMO 
IC with noisy interference, mixed aligned interference, as well as one-sided interference. The capacity 
regions of the MIMO IC with very strong interference and aligned strong interference are also obtained. 
For all the results, Sj, % = 1, 2, can be any positive semi-definite matrix, and the channel matrices Hj, 
j = 1, ■ ■ ■ ,4, can be singular or non-square unless otherwise specified. 

The rest of the paper is organized as follows: we present our main results and numerical examples in 
Sections II and III, and the proofs of the main results are given in Section IV. 

Before proceeding, we introduce some notation that will be used in the paper. 

• Italic letters (e.g. X) denote scalars; and bold letters x and X denote column vectors and matrices, 
respectively. 
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• I denotes the identity matrix and denotes the all-zero matrix. 

• |X|, Xt and X -1 denote respectively the determinant, conjugate transpose, and inverse of the matrix 
X, and ||x|| denotes the Euclidean vector norm of x. 

• radius(X) is the numerical radius [26, p.g. 321] of the square matrix X, and is defined as 



radius(X) = max abs(a^Xa) 



where a. is a complex vector, and abs(-) denotes the absolute value. 



•^i 5 "^2 ' ■ ■ ■ ' ^ n 



t 



is a long vector which consists of a sequence of vectors Xi, i = 1, . . . , n. 

• x ~ CA/(0,£) means that the random vector x has the circularly symmetric complex Gaussian 
distribution with zero mean and covariance matrix S. 

• E[] denotes expectation; Cov(-) denotes covariance matrix; /(•; •) denotes mutual information; h(-) 
denotes differential entropy with the logarithm base e, and log(-) = log e (-). 



II. Main Results 

In this section, we give the capacity regions for MIMO ICs under very strong interference and aligned 
strong interference, and the sum-rate capacities for MIMO ICs under Z interference, noisy interference 
and mixed interference. 

For economy of notation, we introduce a set of matrices 

Bi = |b j all columns of are in the null space of S« | , i = 1, 2, (3) 

i.e., each column of is either a zero vector, or an eigenvector of the covariance matrix constraint 
Si associated with the zero eigenvalue (if Sj has one). This condition is equivalent to the condition 
S»Bt = 0. 



A. Capacity region of MIMO IC under very strong interference 

We begin with the result for the MIMO ZIC (MIMO IC with one-sided interference) with very strong 
interference. 

Theorem 1: For the MIMO IC defined in (1) with H 3 = 0, if 



log 



I + HiSiH{+H 2 S 2 h5 —log I + HiSiHj > log 



I + H 4 S 2 H 



then the capacity region of the MIMO IC is 



{{Ri, 



< Ri < log 



I + HiSiH{ 



< R 2 < log 



I + H4S2HI 



(4) 



(5) 
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where Si and S2 are the respective covariance matrix constraints defined in (2). 

We say that a MIMO ZIC has very strong interference if (4) is satisfied. In this case the interference 
does not reduce the capacity region. Theorem 1 can be easily extended to obtain the capacity region for 
a two-sided MIMO IC under very strong interference. 

Theorem 2: For the MIMO IC defined in (1) and H 2 / and H 3 / 0, if 

log|l + HiSiH{| > log 
log I + H 4 S 2 Hj > log 



log 
log 



I + HiSiHl + HaSaH 



I + H 3 S 1 Hj + H 4 S 2 Hi 



i + h 4 s 2 h| 

I + HiSiH} 



(6) 
(7) 



i + h 4 s 2 h+|} 



(8) 



then the capacity region of the MIMO IC is 

R 2 ) : < i?i < log I + HiSiHl , < R 2 < log 

where S 4 and S 2 are the respective covariance matrix constraints defined in (2). 

Inequalities (6) and (7) are the very strong interference conditions for a two-sided MIMO IC, which 
means that when both users transmit at the maximum rate, both receivers can first decode the interference 
by treating the desired signal as noise, i.e., we have 

I{x*2,v\) > I{x* 2 ;y* 2 I x*) and 

I(x\\vt) > 1 {x* 1 \y* l I x* 2 ), 

where x* ~ CA/(0,Sj) and y* is defined in (1) with X{ replaced by x*, i = 1,2. As with the scalar 
Gaussian IC where the notion of very strong interference depends on both the channel coefficients and 
power constraints, for the MIMO IC our definition of very strong interference involves both the channel 
matrices and the covariance matrix constraints. Let Hi = H 4 = 1, H 2 = yfa, H3 = Vb, Si = Pi and 
S 2 = P 2 , then (6) and (7) become a > 1 + -Pi and b > 1 + P 2 , respectively. Therefore, Theorem 2 
generalizes the capacity region for scalar Gaussian ICs under very strong interference [2]. 

We remark that an alternative definition of MIMO with very strong interference is to use the power 
constraints instead of the the covariance matrix constraints. The conditions as well the corresponding 
capacity region have the same expression as that of Theorem 2 except that Si and S 2 are now replaced 
with the waterfilling covariance matrices for the two intended links in the absence of interference. This 
alternative definition gives a capacity region that is a superset of that defined using the covariance matrix 
constraints with the trace of the covariance matrices being equal to the power constraints. This alternative 
definition also includes the scalar Gaussian ICs under very strong interference as its special case. 
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B. Capacity region of MIMO IC under aligned strong interference 

We begin with the result for the MIMO ZIC under aligned strong interference. 

Theorem 3: For the MIMO IC defined in (1) with H3 = 0, if there exist matrices A and B such that 



H 4 = AH 2 + B, 

where A*A ^ I and B G B 2 , then the capacity region of the MIMO IC is 



(9) 



< Ri < log 
< R 2 < log 
R l + R 2 < log 



I + H4S1H} 



I + h 4 s 2 h+ 



(10) 



I + HiSiHj + H2S2H2 
where Si and S2 are the respective covariance matrix constraints defined in (2). 

Theorem 3 gives the capacity region of a MIMO ZIC under aligned strong interference. If S2 is 
singular, then (9) means that all the columns of il\ — H^A^ are either zero vectors or the eigenvectors of 
S2 associated with eigenvalue 0. If S2 is nonsingular, then H4 = AH2, i.e., H4 is a linear transformation 
of H2. Therefore, users 1 and 2 see x 2 in the forms of H2X2 and AH2X2, respectively. If A^A ^ I, 
then user 1 can decode x 2 if user 2 can. 

The following is a special case of Theorem 3 where we can choose A explicitly as 

A = (H 4 -B)(h^H 2 ) _1 H 2 . (11) 

0, if H2 is left-invertible, i.e., has full 



Proposition 1: For the MIMO IC defined in (1) with H 3 
column rank, and there exists B G B 2 such that 

H^H 2 h (H 4 - B) f (H 4 - B) , 



(12) 



then the capacity region of the MIMO IC is given by (10). 

By choosing Bj = 0, (12) becomes H2H2 h H4H4, which is related only to H2 and H 4 and directly 
mimics that of the scalar Gaussian IC. 

Using Theorem 3, we obtain the capacity region for the two-sided MIMO IC under aligned strong 
interference. 

Theorem 4: For the MIMO IC defined in (1), if there exist matrices Ai, A2, Bi and B2 such that 



Hi = A1H3 + Bi and 
H 4 = A2H2 + B2, 



(13) 
(14) 
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where A^Aj ^ I and Bj 6 Bi, i = 1,2, then the capacity region of the MIMO IC is 



I + HiSiH{ 



I + H 4 S 2 H 



I + HiSiHj + H 2 S 2 H| 
I + H3S1I4 + h 4 s 2 hJ 



(15) 



0< Ri< log 
0< R 2 < log 
Ri + R 2 < log 
Ri + R 2 < log 

where Si and S2 are the respective covariance matrix constraints defined in (2). 
Similarly to Proposition 1, we have the following proposition. 

Proposition 2: For the MIMO IC defined in (1), and where the channel matrices H2 and H3 are both 
left-invertible, if there exist Bj G Bi, i = 1, 2, such that 

Hj,H 2 y (H 4 - B 2 ) f (H 4 - B 2 ) and (16) 
H+H3 h (Hi - (Hi - Bi) , (17) 

then the capacity region of the MIMO IC is given by (15). 

Obviously, Proposition 2 generalizes the capacity region of the scalar Gaussian ICs under strong 
interference. Furthermore, Proposition 2 also generalizes the result of [18] for single-input multiple- 
output (SIMO) ICs under strong interference. In this case, H 2 and H3 are both non-zero column vectors, 
and hence are left-invertible. Therefore, (16) and (17) become H2H2 h H4H4 and H3H3 y H|Hi 
which are the same as ||H 2 || > ||H 4 || and 1 1 H3 1 1 > ||Hi||. 

Let Bi = B 2 = and assume that there exist Ai and A 2 satisfying (13) and (14). We can verify 
Theorem 4 in a way similar to that done in [3] and [4] for scalar Gaussian ICs under strong interference. 
Assuming the rate pair (Ri,R 2 ) is achievable, then X\ and x 2 can be reliably recovered at user 1 and 
user 2, respectively. After subtracting x\ from y lt user 1 obtains 

yi = H 2 x 2 +^i. (18) 

We can pre-multiply y[ by A 2 and get 

y'{ = A 2 H 2 x 2 + A 2 zi 

= H 4 x 2 + A 2 2,. (19) 
Since X\ is recovered at user 1, we can add H3X1 to (19). Thus user 1 can eventually compute 

VT = H 3 xi + H 4 x 2 + A 2Zl . (20) 
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If A|A 2 ^ I, by Lemma 6 we have A 2 A 2 d: I and the received signal at user 2 can be written as 

2/2 = H3S1 + H4X2 + Z-2 

= y'i+w, (21) 

where w ~ CN (o,I- A 2 A 2 ), and w is independent of all other random vectors. Since x 2 can be 
recovered from y 2 , x 2 can also be recovered from y 1 ". Thus, user 1 can decode both X\ and x 2 . Similarly, 
user 2 can also decode both X\ and x 2 . Therefore, the MIMO IC is now a compound MIMO multiple- 
access channel, whose capacity region coincides with (15) [27]. The above development imposes no 
structure on x,. % = 1,2. Therefore, as long as the input signal x, (which can be non-Gaussian with 
arbitrary covariance matrix) can be decoded by its desired receiver, it can also be decoded by the other 
receiver. This result applies to MIMO ICs under a variety of power constraints, for example, peak power 
constraints, average power constraints and per-antenna power constraints. We state this formally in the 
following proposition. 

Proposition 3: For the MIMO IC defined in (1) with expected per-symbol power constraints, or 
expected block power constraints, or per-antenna expected block power constraints, if there exist matrices 
Aj, i = 1, 2, such that AjAj < I and 



Hi = A1H3 
H 4 = A 2 H 2 , 
then the capacity region of the MIMO IC is 

< Ri < log 



and 



(22) 
(23) 



u 

(s 1 ,s 2 )ep 



I + HiSiH^ 



I + H 4 S 2 H 



I + HiSiHj + H 2 S 2 H 2 
I + H 3 SiH5 + H 4 S 2 H^ 



(24) 



< R 2 < log 
R l + R 2 < log 
R x + R 2 < log 
where V denotes the specified power constraints. 

For this result, we say that there is an expected per-symbol power constraint, an expected block power 
constraint, and a per-antenna expected block power constraint, respectively, if the following conditions 
must be satisfied: 



tr (E [xijxj^j < P u j = l,- 
< nPj or 



,n, 



X ij X ij 



(25) 
(26) 
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n 

Yl { E Xi i x h ) k - nPik > 



(27) 



where (■)& denotes the kth diagonal element of a square matrix, and is the power constraint for the 
kth antenna of user i. 

Theorem 4 has relaxed conditions on the channel matrices as compared to Proposition 3. The extra 
term Bj in Theorem 4 results from the covariance matrix constraint Sj. Suppose (13) and (14) hold and 
the input signal of user i is x* ~ CM (0, Sj). From Theorem 4, x* achieves the capacity. Applying the 
same procedure in (18)-(20) to y t , we obtain the counterpart of (19) 

y" = A 2 H 2 x 2 + A 2 Zi 

= (A 2 H 2 + B 2 )a^ + A 2 zi 
= H 4 x 2 + A 2 zi, 

where the second equality holds since 

Cov(B 2 x^) = B 2 S 2 B^ = 0, 



(28) 



(29) 



and hence B 2 x 2 = 0. Therefore, y 2 can also be written as (21). 

The difference between Proposition 3 and Theorem 4 is that (22) and (23) ensure that Xi can be reliably 
decoded at both receivers as long as it can be decoded at the desired receiver, while (13) and (14) ensure 
that the capacity-achieving x* can be reliably decoded at both receivers. 



C. Sum-rate capacity of MIMO IC under noisy interference 

In [10], we say that an IC has noisy interference when treating interference as noise achieves the 
sum-rate capacity. In this section, we present the sum-rate capacity results for MIMO ICs that have noisy 
interference. 

Theorem 5: For the MIMO IC defined in (1) with H3 = 0, if there exist matrices A and B that satisfy 

H 2 = AtH 4 + B, 



where At A ^ I and Bg6 2 , then the sum-rate capacity of the MIMO IC is 



log 



I + HiSiH{ (l + H 2 S 2 B4) 



log I + h 4 s 2 h| 



(30) 



(31) 



where Si and S 2 are the respective covariance matrix constraints defined in (2). 
Similarly to Proposition 1, we obtain the following result. 
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Proposition 4: For the MIMO IC defined in (1) with H3 = 0, if H4 is left-invertible and there exists 
B G £>2 such that 

H4H4 h (H 2 - B)t (H 2 - B) , (32) 

then the sum-rate capacity of the MIMO IC is given by (31). 

Theorem 5 gives the noisy-interference sum-rate capacity of a MIMO ZIC. Specifically, when (30) 
is satisfied, the sum-rate capacity can be achieved by treating interference as noise. Consider a scalar 
Gaussian IC where Hi = H4 = 1, H2 = \fa and H 3 = 0. Equation (30) is < a < 1. Therefore, 
Theorem 5 includes the scalar Gaussian ZIC noisy -interference sum-rate capacity as a special case 1 . 
For a SIMO IC where Hi, H3 and H4 are all nonzero column vectors, Proposition 4 shows that if 
IIH2II < ||H4||, the sum-rate capacity is achieved by treating interference as noise. 

Similarly to Proposition 3, if we choose B = in (30), then Theorem 5 can be extended for different 
power constraints. We state this formally in the following proposition. 

Proposition 5: For the MIMO IC defined in (1) with expected per-symbol power constraints, or 
expected block power constraints, or per-antenna expected block power constraints, if H3 = and 
there exists a matrix A such that AU^I and 

H 2 = A^, (33) 

then the sum-rate capacity is 

max I log 

(s 1 ,s 2 )ep V 

where V denotes the specified power constraints. 

Next, we give the noisy-interference sum-rate capacity of a two-sided MIMO IC. Note that this result 
does not require Si or S 2 to have full rank (see [22] and Example 4 below). 

Theorem 6: For the MIMO IC defined in (1), if there exist matrices Aj, Bj 6 Bi, and Hermitian 
positive definite matrices S«, i = 1, 2, such that 

A* Ai ^ Ei ^ I - A2S2 1 A\, (35) 
AjA 2 dS 2 ^I-AiSj- 1 A}, (36) 
H 3 = A} (h 2 S 2 h£ + i) ~* Hi + Bi and (37) 

'The case with a < 1 is often referred to as ZIC with weak interference in the literature. We use the term noisy-interference 
simply because of the fact that treating interference as noise achieves the sum-rate capacity. 



+ H1S1HI (l + H 2 S 2 H+y 



+ log 



I + H 4 S 2 H 



(34) 
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H 2 = Aj (H3S1HJ + i) 1 H 4 + B 2 , (38) 
then the sum-rate capacity of the MIMO IC is 



log 



I + HiSxH^ (l + H 2 S 2 H 2 ) 1 + log I + H 4 S 2 H| (i + H3S1HJ) 



(39) 



where Si and S 2 are the respective covariance matrix constraints defined in (2). 

Theorem 6 gives sufficient conditions for the MIMO IC under which treating interference as noise 
achieves the sum-rate capacity. In the case where both Hi and H4 are left-invertible, the following 
conditions are sufficient for (37) and (38): 

Ai = (i + H 2 S 2 H3,) Hi (h}Hi) ~* (h 3 - B{) and (40) 

A 2 = (i + HaSiHj) H 4 (h+H 4 ) ~* (h 2 - B 2 ) . (41) 

That is, such matrices Ai and A 2 exist when Hi and H4 are left-invertible. It remains to find matrices 
Bi G B\ and B 2 G £> 2 such the matrix inequalities (35) and (36) have solutions. We state this formally 
in the following proposition. 

Proposition 6: For the MIMO IC defined in (1), if Hi and H4 are left-invertible, and there exist 
symmetric positive definite matrices Si and S 2 that satisfy (35) and (36) with Ai and A 2 defined in 
(40) and (41) for some Bi G B\ and B 2 G £> 2 , then the sum-rate capacity is given by (39). 

Although Theorem 6 gives the noisy interference conditions for a MIMO IC, finding explicit solution 
of the matrix inequalities (35) and (36) can be very complex. Therefore, using Theorem 6 to check 
whether a MIMO IC has noisy interference is not practical. We thus derive the following proposition 
that is a special case of Theorem 6 but is more amenable to computation. 

Proposition 7: For the MIMO IC defined in (1), the sum-rate capacity is given by (39) if 

radius (*i)<-, * = 1,2, (42) 

where 

*i = (i - A}Ai - A 2 A 2 ) _i a{a1, (i - AjAi - A 2 A 2 )" 3 (43) 

* 2 = (i - AiAj - A 2 A 2 ) " AjAj (i - AiA} - Aj A 2 ) ~' 2 , (44) 

and Ai and A 2 are chosen to satisfy (37) and (38) respectively, and Bj G Bi, i = 1, 2. 

In the scalar case, if we have Hi = H 4 = 1, H 2 = y/a, H3 = y/b, Si = Pi and S 2 = P 2 , from (42) 
we directly have 

Va(l + bPy) + Vb(l + aP 2 ) < 1. 

July 2, 2009 DRAFT 



12 



The above condition can also be obtained from Theorem 6 after some mathematical manipulations. 
Therefore Theorem 6 and Proposition 7 generalize the noisy-interference sum-rate capacity of the scalar 
Gaussian IC [10]-[12] to the MIMO IC. 

Similarly to Proposition 6, we obtain the following proposition. 

Proposition 8: For the MIMO IC defined in (1), if both Hi and H4 are left-invertible, and there exist 
matrices Bi G B\ and B2 £ B2 such that the Ai and A2 defined in (40) and (41) satisfy (42), then the 
sum-rate capacity is (39). 



D. Sum-rate capacity of MIMO IC under mixed aligned interference 

Theorem 7: For the MIMO IC defined in (1), if there exist matrices Ai, A2, Bi and B2 that satisfy 



Hi = A1H3 + Bi and 
H 2 = A 2 H 4 + B 2 , 

where AjAj ■< I and Bj 6 Bi, i = 1, 2, then the sum-rate capacity of the MIMO IC is 

log 



(45) 
(46) 



mm < 



log 



I + H 3 SiH| + H 4 S 2 H^ 



I + H1S1H} (i + h 2 s 2 hJ) 



+ log 



I + H4S2H 



(47) 



where Si and S2 are the respective covariance matrix constraints defined in (2). 

Proposition 9: For the MIMO IC defined in (1) where H3 and H4 are left-invertible, if there exist 
Bj £ Bi, i = 1, 2, such that 



H+H 4 y (H 2 - B2) 1 " (H 2 - B 2 ) 
H3H3 y (Hi - Bi)^ (Hi - Bi) 



and 



(48) 
(49) 



then the sum-rate capacity is given by (47). 

Theorem 7 gives the sum-rate capacity of the MIMO IC under mixed aligned interference, i.e., one 
user sees aligned weak/intermediate interference and the other user sees aligned strong interference. 
The sum-rate capacity is achieved by treating interference as noise at the receiver that sees aligned 
weak/intermediate interference, and fully decoding the interference at the receiver that sees aligned strong 
interference. Proposition 9 includes the sum-rate capacity of scalar Gaussian ICs with mixed interference 
as a special case. If we choose Bi = and B2 = 0, the constraints (48) and (49) reduce to H4H4 y 
H2H2 and H3H3 y H|Hi. The MIMO ICs that satisfy these two simplified conditions have mixed 
interference and this result applies to channels with other power constraints. 
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Similar to Propositions 3 and 5, we obtain the sum-rate capacity for MIMO ICs with aligned mixed 
interference under different power constraints. 

Proposition 10: For the MIMO IC defined in (1) with expected per-symbol power constraints, or 
expected block power constraints, or per-antenna expected block power constraints, if there exist matrices 
Aj, i = 1, 2, such that A J A, < I and 



then the sum-rate capacity is 



max mm < 
(Si,S 2 )eP log 



H 2 = 

log 



Ai H 3 and 



A'H 4 , 



I + H3S1H3 + H 4 S 2 HT 



1 + 



HiSiHj (i + H 2 S 2 Hj) 



+ log 



I + H 4 S 2 Hj 



where V denotes the specified power constraints. 



(50) 
(51) 



(52) 



E. Generalizations 

The results in the previous sections are for MIMO ICs whose capacities are achieved by Xi ~ 
CM (0, Si), i = 1, 2, where S« is the covariance matrix constraint for user i defined in (2). The methods 
introduced can also be applied to more general cases in which the capacity is achieved by x\ ~ CM (0, S£) 
where S£ -< Sj. For example, consider the following generalization of Theorem 5 that gives the sum-rate 
capacity of a class of MIMO ZICs under noisy interference. This extension applies to all the corresponding 
theorems for other kinds of interference. 

Theorem 8: For the MIMO IC defined in (1), if H 3 = and the optimal A*, and S?j for the 
following optimization problem 



min max C ( A, Si , S 2 ) 

A §!,§„ v ' 

subject to AA 1 " < I, ■< §1 ■< Si, < S 2 < S 2 



satisfy 



H 2 = A* t H 4 + B, 



(53) 



(54) 



where 
c(A,Si,S 2 ) =log 

+ log 



HiSiHl + H 2 S 2 H 2 + I 



- log 



I- AA+ 



H 4 S 2 H^ + I - (H 4 S 2 H^ + A) (h 2 S 2 H^ + 1) 1 (H 2 S 2 H^ + A f ) 



, (55) 
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and 



B G |b all columns of are in the null space of S 2 j , 
then the sum-rate capacity for the MIMO IC is 



log 



i + His;h{ (i + h 2 s^h+) 1 + log |i + n 4 s* 2 u\ 



(56) 



(57) 



The solution of problem (53) is an upper bound on the sum-rate capacity of this MIMO ZIC. The 
bound is tight when (54) is satisfied. Theorem 8 includes Theorem 5 as a special case in which Si and 
S2 are optimal for problem (53). 

III. Numerical Results 
Example 1: Consider a MIMO IC with 





2.0 


1.5 




1.2 


2.0 


Hi = H 4 = I, H 2 = 






, H 3 = 








0.8 


1.0 







0.8 



and Si = S 2 = I. 

Conditions (6) and (7) are satisfied. Therefore this MIMO IC has very strong interference and the capacity 
region is 

{(Ri,R 2 ) : < i?i < 1.3863, < R 2 < 1.3863} . 

However, consider the aligned strong interference conditions (13) and (14) for this channel. We have 
Ai = HJ \ A 2 = H^ 1 and Bi = B 2 = 0, where A|Ai ^ I and A^A 2 ^ I. Therefore, the above 
channel has very strong interference but not aligned strong interference. 
Example 2: Consider a MIMO IC with 

0.8 1.0 -0.5 0.6 



Hi 



H a 



Si 



1.8 


0.8 


-0.6 


1.4 


1.2 


-1.9 


0.5 


-0.7 


1.0 


1.0 


0.5 0.5 






0.4 


0.2 


1 0.6 






0.9 


0.4 


1.0 0.1 




0.4 


0.4 


-0.4 




1.0 





2.0 1.0 




0.1 


-0.4 


1.0 0.9 





H, 



H, 



1.0 -1.2 0.4 1.2 

0.68 0.36 -0.22 0.6 
1.04 -0.66 0.17 1.14 



and S 2 = I. 
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Conditions (13)-(14) are both satisfied by choosing 



0.8 
0.5 



0.6 0.2 
0.3 0.8 



10-11 
1-2 0-1 



and B 2 = 0. 



By Theorem 4, this MIMO IC is under aligned strong interference and the capacity region is 
{(Ri, Ba) ■ < i?i < 1.6770, < R 2 < 1.8636, < R x + R 2 < 3.2812} 



Example 3: Consider a MIMO ZIC where 

1.3 1.1 1.4 
Hi I. H 2 = 1.5 -0.5 3.0 
0.9 -0.36 1.5 



H, 



0, H 4 



1.0 2.0 0.5 
1.0 1.0 2 
0.5 0.4 0.5 



Si = I and S 2 



1.8 1.0 -0.4 

1.0 5.0 2.0 

-0.4 2.0 1.2 
Condition (30) is satisfied by choosing 





0.8 










0.5 


-0.5 


1.0 


A = 





0.5 





and B = 


1.0 


-1.0 


2.0 










0.6 




0.6 


-0.6 


1.2 



By Theorem 5 or Proposition 4, the above MIMO ZIC is under noisy interference and the sum-rate 
capacity C = 5.6622 is obtained from (31). 
Example 4: Consider a MISO IC with 

6.0 4.0 5.0 

0.5 0.8 1.0 

0.4 0.6 0.1 

3.0 -2.0 6.0 



Hi 




H 2 




H 3 




H 4 





Si 



0.9 0.5 
0.5 2.5 
-0.2 1 



-0.2 
1 

0.6 



and S 2 



2.2 -0.2 -0.6 
-0.2 0.2 -0.4 
-0.6 -0.4 1.3 



Condition (42) is satisfied by choosing 

Ai = 0.1578, A 2 = 0.2394, B x 



-0.2,0.2,-0.4] and B 2 = [0.2, 1.0, 0.4]. 



By Proposition 7, this MISO IC is under noisy interference and the sum-rate capacity C = 7.7171 
is achieved by treating interference as noise. In this case rank (Si) = rank(S 2 ) = 2. However, if we 
use average power constraints P\ = tr(Si) = 4.0 and P 2 = tr(S 2 ) = 3.7 instead of the covariance 
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matrix constraints Si and S2, then using the optimality of beamforming for single-user detection of 
MISO ICs [25], we can achieve a sum rate of R± + P 2 = 9.9162 by treating interference as noise 
and choosing Sj = 7^7+, rank (Si) = 1, % = 1,2, where 7l = [1.2133,-0.0181,1.5899]+ and -y 2 = 
[0.5673, -1.4460,1. 1345] t. 
Example 5: Consider a MIMO IC under average power constraints Pi = 8 and P2 = 1 with 

Hi = diag[1.0392, 1.5937, 1.2689], H 2 = diag[0.7746, 0.2646, 0.3162], 
H 3 = diag[0.3000, 0.6083, 0.3162] and H 4 = diag[1.5330, 1.2124, 1.3784]. 

Since all the channel matrices are diagonal, this MIMO IC can be considered as a parallel IC. From 
[24, Theorem 3], this MIMO IC is under noisy interference and the sum-rate capacity C = 6.1066 can 
be achieved by independent coding across antennas and treating interference as noise. The optimal input 
signals are Gaussian with covariance matrices 

Si = diag [2.0922, 3.3021, 2.6057] and S 2 = diag [0.4472, 0, 0.5528] , 

where tr (S\) = Pi and tr (S 2 ) = P 2 . The input covariance matrix of the second user is singular and the 
second antenna is inactive. 

If the average power constraints Pi and P 2 are replaced by covariance constraints: 



Si 



2.0922 0.5000 1.0000 
0.5000 3.3021 
1.0000 2.6057 



and S 2 



0.4472 0.1500 


0.1500 0.5528 



where tr(Si) = Pi and tr(S 2 ) = P 2 but Si ^ Si and S 2 ^ S 2 . Conditions (40) and (41) are satisfied by 
choosing 



0.3661 0.0092 
A ; - 0.3817 A , - 

0.0106 0.2630 
It can be obtained from (42) that radius ($1 



0.6004 0.0199 0.0218 
0.0461 0.4848 
0.0479 0.2892 
0.4614 and radius(* 2 ) 



and Bi = B 2 = 0. 



0.1822. Therefore, from 



Proposition 7 this MIMO IC is under noisy interference and the sum-rate capacity C = 5.9541 is 
achieved by treating interference as noise. 



IV. Proofs of the Main Results 
We first introduce some lemmas which will be used to prove our main results. 
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A. Preliminaries 

The following lemma is based on the fact that a Gaussian distribution maximizes conditional entropy 



under a covariance matrix constraint [28]. 



Lemma 1: Let x[ l = 



x 



t 



x ] 



t 



, i = 1, . . . , k, be k long random vectors each of which consists 



of n vectors. Suppose the Xij, % = 1, ■ ■ ■ , k all have the same length Lj, j = 1, • • • , n. Let y r 
, where y has length Lj, be a long Gaussian random vector with covariance matrix 



Cov(y") = ^A l Cov(x"), 



(58) 



where Yli=i = 1> ^ — and |Cov (x™)| > 0. Let 5 be a subset of {1,2,..., n} and T be a subset of 
5's complement. Then we have 



J2 Xih ( Xi < s l Xi - r ) ^ h (Vs \Vt ) ■ 



(59) 



i=i 



Proof: See Appendix A. 

When the Xk, k = 1, ■ ■ ■ , n are all Gaussian distributed, Lemma 1 shows that h (xs \xt) is concave 
over the covariance matrices. 

Lemma 2 includes some special cases of Lemma 1 . 

Lemma 2: Let x k = {xi, ■ ■ ■ ,Xk} and y k = {y 1 , ■ ■ ■ ,y k } be two sequences of random vectors, and 
let x*, y* , x* and y* be Gaussian vectors with covariance matrices satisfying 



Cov 



x 



■ — He. 

y 



l 



i=l 



Xi 




X* 




r< Cov 




Vi 




T. 



Then we have 



h (x k j < k- h(x*) < k- h(x*) and 

h (y k \x k ^j < k ■ h (y* \x*)<k-h (y* \x* ) . 



(60) 

(61) 
(62) 



Proof: See Appendix B. 

Lemma 3: Let x n = {x\, ■ ■ ■ ,x n } be a sequence of n random vectors and let x* and x* be Gaussian 
random vectors with covariance matrices 

1 n 

Cov (x*) = - Cov (xi) r< Cov (x*) . (63) 
n i=i 



July 2, 2009 



DRAFT 



18 

Let z and z be two independent Gaussian random vectors and z n and z n be two sequences of random 
vectors each independent and identically distributed (i.i.d.) as z and z, respectively. We have 

h (x n + z n )-h (x 11 + z n + z n ) < nh (x* +z)-nh(x* +z + z) (64) 

< nh (i* + z) - nh (x* + z + z) . (65) 

Proof: See Appendix C. 

Lemma 4: [29, page 107] [30] Let x,y and z be joint Gaussian vectors. If Cov(y) is invertible, then 
x — > y — ► z forms a Markov chain if and only if 

Cov (x,z) = Cov (x.y ) Cov (y) _1 Cov (y, z) . 
Using Lemma 4 we obtain the following lemma. 

Lemma 5: Let x, u and v be jointly Gaussian vectors, such that x is independent of u and v. Denote 
Cov (x) = S x , Cov (u) = S„ and Cov («,«) = S„ r . If S„, is invertible, then x — > Hx + « — > Gz + v 
forms a Markov chain if and only if 

SajG 1 " = S a; H 1 'S~ l S u „. (66) 



Proof: See Appendix D. 
I A 



Lemma 6: ^ if and only if B y A 1 " A. If B >- 0, then B >z A f A if and only if 

At B 

I y AB^Al 
Proof: See Appendix E. 

Lemma 7: If B is left-invertible (or B+B is invertible) and A = B (B^B) -1 C 1 ", then A* A ^ I or 
AAt r< I if and only if B^B h 
Proof: See Appendix F. 

Lemma 8: [31, Theorem 5.2] Suppose W is nonsingular and M is positive definite. Then the matrix 
equation 

X + WtX^W = M 
has a positive definite solution X if and only if 

radius (]VT 2\V]Vr 2 ^ < _ 



July 2, 2009 



DRAFT 



19 



Using Lemma 8, we obtain necessary and sufficient conditions for a pair of matrix equations to have 
positive definite solutions. 

Lemma 9: Suppose Ai and A2 are fixed, and I is the identity matrix, the following matrix equations 



Si = I - A 2 S 2 1 Aj and 



have positive definite solutions Si >- A}Ai and S 2 >- A 2 A 2 if and only if 



S 2 

t. 



I-AiS^Al, 



radius < -, i = 1,2, 



where 



*i = (i - A{Ai - A 2 A 2 ) ~ 2 a\a\ (l - A}Ai - A 2 A| 
* 2 = (i - AiAj - AjA 2 )~ 3 a\a{ (l - A ± A\ - A 2 A 2 y 



and 



(67) 
(68) 

(69) 

(70) 
(71) 



Proof: See Appendix G. 
B. Proof of Theorem 1 

The converse follows by giving receiver 1 the message not destined for it and applying the maximum- 
entropy theory to show that Gaussian input distributions are optimal. To prove achievability, let x± ~ 



CjV(0,Si) and a; 2 ~ CM(0,S 2 ), and let user 1 transmit at rate R\ = log 



2 transmit at rate i? 2 = log 



I + H 4 S 2 H 



I + HiSiH} 



, and user 



Inequality (4) guarantees that user 1 can first decode x 2 
by treating X\ as noise. After the interference is subtracted, user 1 sees a single-user Gaussian MIMO 
channel. Therefore, the rate region (5) is achievable. 

C. Proof of Theorem 2 

Similarly to the proof of Theorem 1, the converse follows by giving each receiver the message not 
destined for it and applying the maximum-entropy theory to show that Gaussian input distributions are 
optimal. To prove the achievability, let X\ ~ CM (0, Si) and x 2 ~ CM (0, S 2 ), and let user 1 transmit at 



rate R 



log 



I + HiSiHl 



I + h 4 s 2 h1 



. Inequalities (6) and 



, and user 2 transmit at rate 7? 2 = log 
(7) guarantee that each user can first fully decode the interference by treating the desired signals as noise. 
After the interference is subtracted, each user sees a single-user Gaussian MIMO channel. Therefore, the 
rate region (8) is achievable. 
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Z-2 


~ CM |o, 


I A 






) 


n 




At I 





D. Proof of Theorem 3 and Proposition 1 

Suppose the channel is used n times. The transmitted and received vector sequences are denoted by 
xf and yf for user i, i = 1,2, and xf satisfies (2). 

Since A T A < I, from Lemma 6, there exists a Gaussian random vector n whose joint distribution 
with 22 is 

zo\ i r i a! \ 

(72) 

Moreover, from (9), n is of the same dimension as z\ hence has the same marginal distribution as z\. 
Let e > and e — > as n — > +oo, From Fano's inequality, any achievable rates must satisfy 

+ R2) — ne 
<I(x1;yV + I(x%;y2) 

< I(x';:y';)~ I(x!j:y!l.H->x!j+n") 

= h (Hiz',' + H,x'i + 2?) - /t (H 2 x£ + «?) + /i (H 2 a# + n n ) - /i (n n ) 

+/i (H 4 x£ + 2^ I HsXo 1 + n n ) -fc(*2 I n n ) 
= / (x?, x£; B^x? + Hsxa 1 + 2?) + /i (H 4 x£ + 2^ | H 2 x^ + n n ) - h {z'2 I n n ) 

(b) 

< /(x™,X2;Hix™ + H2X2 +2™) + n/i (H4X2 + 2 2 | H2X2 + n) - n/i (z 2 I n) 

= J (x?,x£; Hix? + H 2 x 2 l + 2?) + nh (H 4 x* 2 + 2 2 | H 2 x^ + n,x* 2 ) - nh (z 2 \ n) 

= /(x','.X2':H,x',' -H 2 x 2 ' +2',') 

< nlog|l + HiSiH} + H 2 S 2 H$ 

1 ,2j 2 ,...,2] n and n n = n{,nJ,, . . . ,n\ , i = l,2, and 
are i.i.d. as (72). 

Equality (a) is from the fact that n and z\ have the same marginal distribution. Inequality (b) is by 
Lemma 2, and we let x* ~ CM (0, Sj), i = 1,2. x\ is independent of x 2 * and y* is defined in (1) with 
Xi replaced by x*. Equality (c) is from (9) which means 

S 2 H+ = S 2 (HjAt + Bt) = S 2 HjAt. 

By Lemma 5, x\ — > H2X2 + n — ► H4X2 + 22 forms a Markov chain. 

Therefore, (10) is an outer bound for the capacity region. On the other hand, (10) is also achievable 
by requiring user 1 to decode messages from both users. Therefore, Theorem 3 is proved. 



where 2™ 



(73) 



n, 
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If H2 is left-invertible, we can choose 



At = H 2 (HJ,H 2 ) '(hJ-B*), 



(74) 



so that (9) is satisfied. By Lemma 7, A^A ^ I is equivalent to (12). Thus Proposition 1 is proved. 

E. Proof of Theorem 4 and Proposition 2 

Theorem 4 can be proved by using Theorem 3 twice. To prove a converse, we first remove the 
interference link from transmitter 1 to receiver 2 and obtain a MIMO ZIC with H3 = 0. The capacity 
region of the original MIMO IC is a subset of the capacity region of this MIMO ZIC because we are 
effectively giving user l's message to receiver 2. Theorem 3 gives the capacity region of this MIMO 
ZIC with (14). Similarly, we remove the interference link from transmitter 2 to receiver 1 and obtain a 
MIMO ZIC with H 2 = 0. Theorem 3 gives the capacity region of this MIMO ZIC with (13): 



Thus, the capacity region of the original MIMO IC is included in the intersection of (10) and (75) 
which is (15). On the other hand (15) is achievable by requiring both receivers to decode messages from 
both transmitters, and therefore (15) is the capacity region. 

Proposition 2 is similarly proved as Proposition 1. 

F. Proof of Theorem 5 and Propositions 4 and 5 

Since A^ A ^ I, from Lemma 6 there exists a Gaussian random vector n whose joint distribution with 




< 



i?i < log I + HiSiH} 
R 2 < log I + H 4 S 2 H| 
#1 + R 2 < log I + H3S1I4 + H 4 S 2 Hj ^ 



> . 



(75) 



2 2 IS 




(76) 



Moreover, (30) and (76) mean that n and Z\ have the same dimension and distribution. 



From Fano's inequality, any achievable rates must satisfy 



n(R\ + R2) — ne 



</(x^;^) + /(x^y 2 t ) 



<I(xbyV+I(x2;v2,H 2 x2 + n n ) 
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= h (Hix? + H 2 a;2 + 2?) - h (H 2 ^ + zj) + h (H 2 x T 2 l + n n )-h (n n ) 

+h (H 4 x£ + z£ | H 2 x 2 ' +n n ) - h (z% \ n" ) 
= h (Hix? + Hao;^ + (n n ) + /i (H4X2 1 + | H 2 x^ + n n ) - h{z 2 l \ n n ) 

< nh (HiX* + H2a;2 + Z\) — nh (n) + nh (H4X2 + z 2 \ H 2 x 2 + n) — nh{z 2 \ n) 
= nh (H\X* y + H2X2 + Zi) — nh (n) + nh (H4X2 + z 2 ) + nh (H2X2 + n | H4X2 + z 2 ) 

—nh (H 2 x 2 + n) — nh (z 2 \ n) 

(c) 

= nh (Hix* + H2X2 + Z\) — nh (n) + nh (H4X2 + z 2 ) + nh (H2X2 + n \ H4X2 + z 2 , x 2 ) 

— nh (H2X2 + n) — nh (z 2 \ n) 

= nh (H1X4 + H 2 x-j + Z\) — nh (n) + nh (H4X2 + z 2 ) +nh(n\ z 2 ) — nh (H2X2 + Z\) — nh(z 2 \ n) 

= nh (HiX, + H2X2 + Z\) — nh (H2X2 + Z\) + nh (H4X2 + z 2 ) — nh (z 2 ) 

-1 



nlog 



I + HiSiH{ (l + H 2 S2H 2 



+ nlog 



I + h 4 s 2 hJ 



(78) 



where n n = 



t 



, and the are i.i.d. Gaussian vectors distributed as (76). 
Equalities (a) and (d) are both from the fact that n and Z\ have the same marginal distribution. Inequality 
(b) is from Lemma 2, and we let x* ~ CM (0, Sj), i = 1,2. x* is independent of x 2 and y* is defined 
in (1) with Xj replaced by x*. Equality (c) is from (30) which means 

S 2 H 2 = S 2 HU. 



By Lemma 5, x\ —>■ H4X?; + z 2 — > H2X2 + n forms a Markov chain. 

Since (31) is achievable, the sum-rate capacity is (31) if (30) holds. Therefore, Theorem 5 is proved. 
When H4 is left-invertible, we can choose 



A = H 4 (h|h 4 ) ^H 2 -Btj. (79) 

Then (30) is satisfied. By Lemma 7, A^A ^ I is equivalent to (32), therefore Proposition 4 is proved. 
Proposition 5 is proved in a similar way as Theorem 5. Let Xj ~ CM ^0, S^j, i = 1,2, where 

1 n 

Si = -^2Cov( Xij ). (80) 



rt 



n 



From Fano's inequality, we have 

n(i?i + R 2 ) — ne 

< Kx'^y'n + Kx^y^.H^+n") 
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= h (Hix? + H 2 x£ + 2?) - /i (n n ) + /i (H 4 x£ + z% \ H 2 x£ +n n ) - h{z^\n n ) 

( a ) ^ 

< n/i (H1X1 + H 2 x 2 + «i) — i/i (n) + n/i (H 4 x 2 + 2 2 | H 2 x 2 + n) — n/i (z 2 | ») 

= nh (Hi$i + H 2 X2 + zi) - nh (n) + n/i (H 4 x 2 + 22) + nh (H 2 x 2 + n | H 4 x 2 + 22, ^2 ! 

— nh (H 2 X2 + n) — nh (z 2 \ «) 
= nh ( H ! x 1 + H2X2 + *i) - n/i ( H 2 x 2 + n) +nh (H 4 x 2 + 22) - ra/i (2 2 ) 



log 



+ HiSiHl (i + H 2 S 2 H|) 



-1 



+ log 



I + H 4 S 2 H+ 



where (a) is from Lemma 2; and (b) is from (33) which means S 2 H| = S2H4A and thus by Lemma 5, 
x 2 — > H 4 x 2 + 22 — > H 2 x 2 + n forms a Markov chain. 



rt 



(81) 



G. Proof of Theorem 6 and Proposition 6 

Since there exist 5] 4 and S 2 which satisfy (35) and (36), by Lemma 6, there exist two random vectors 
n 4 and n 2 whose joint distributions with Z\ and 22 are 



CM 0, 



1,2. 



(82) 



and 



(83) 
(84) 



Furthermore, from (35) and (36) we have 

Cov(f»i) ^ Cov (22 I n 2 ) 

Cov(ri2) ^ Cov (21 1 ni ) . 
From Fano's inequality, any achievable sum rate R\ + R 2 must satisfy 
n(Ri + i? 2 ) — ne 
</(*?;»?) + 

< / (ar?; »?, H 3 x? + n?) + 7 (x£; y 2 l , H 2 x^ + n£) 

= h (H3X? + »?) - /»(n?) + Mv? I H 3 x? + n? ) - (H 2 x£ + z? | n? ) + h (H 2 x£ + n£) - h(n$) 
+h (y% I H 2 x 2 + n n 2 )-h (H 3 x? + 2^ | n£ ) , (85) 



where n" 



4,l' n !,2> • • • > n I,ra 



, and the Bjj are i.i.d. Gaussian vectors distributed as (82). Since n±j 



is independent of n^k, and z 2 j is independent of n 2 ,fc> f° r anv J / fr° m (83) we have 

Cov«) ^Cov(2 2 l | ra£). (86) 
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By Lemma 3 we have 

h (H- A x'{ + <) - h (U 3 x r l + z% | n!l ) < nh (H :i x\ +n l )-nh (H^x\ +z 2 \n 2 ), 
where x\ ~ CN (0, Si). Similarly, we have 

h (H 2 x 2 + n 2 ) - ^ ( H 2^2 + «i I «i ) < nh (H2X2 + n 2 ) - nh (R 2 x* 2 + z x | n x ) , 

where x^ ~ CAA(0,S 2 ). 
By Lemma 2 we have 

/i (y? I H 3 x? + n? ) < n/i iy\ \ H { x; + n, ) and 

/i (^2 I H 2 x 2 l + n 2 l ) < nh(y 2 \ H 2 x* 2 + n 2 ), 

where y* is defined in (1) with Xj, j = 1, 2, replaced by x*. 
On substituting (87)-(90) into (85) we have 

R\ + i?2 — e 

< h (H3X1 + m) - h (m) + /i I H3X1 + ni ) - h (H2X2 + «i I »i ) 

+h (H2X2 + n 2 ) - h (n 2 ) + h (1/2 I H2X2 + n 2 ) - /i (H 3 i* + z 2 I 1*2) 
= 7(xi;yi,H 3 x* +m) + 7 (xlj ; 2/2, H 2 x^ +"2) 



(a) 



7(x;;yi) + /(x^:^). 



log 



I + HiSiHj (l + H 2 S 2 H$) 1 + log I + H4S2HI (i + H3S1HS) 



(87) 



(88) 



(89) 
(90) 



(91) 



where (a) is from (37), (38) and Lemma 5 since x* — > y\ — > H3X1 + Bi and X2 — > y 2 — > H 2 x 2 + n 2 
form two Markov chains. 

On the other hand (91) is achievable by treating interference as noise, and therefore (91) is the sum-rate 
capacity. 

Proposition 6 is straightforward from Theorem 6. 



H. Proof of Proposition 7 and Proposition 8 

Since matrices Ai and A2 satisfy (42), by Lemma 9 there exist two Hermitian positive definite matrices 
Si and S 2 that satisfy 

(92) 



a}Ai r< Si 



I - A 2 £ 2 1 A], and 



A+A 2 ^ S 2 = 1- AiS^A}. 



-i A t 



(93) 
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Thus, we see (35) and (36) are satisfied. Since (37) and (38) are satisfied by hypothesis, Proposition 7 
follows by Theorem 6. 

Proposition 8 is straightforward from Proposition 7. 



7. Proof of Theorem 7 and Propositions 9 and 10 

The achievability part is straightforward by letting user 2 first decode the message from user 1 and 
then decode its own message, and by letting user 1 treat signals from user 2 as noise. Then user 1 and 
user 2 have the respective rates 

log 
log 



mm 



-i 



I + H1S1H} (l + H 2 S 2 H 2 ) 
I + HaSiH^ (l + H 4 S 2 Hi) _1 



and 



R 2 = log 



I + H 4 S 2 H 



Therefore, the sum rate (47) is achievable. 

To prove the converse, we first let H 2 = 0. By using (45) and Theorem 3, the sum rate satisfies 



R\ + R2 < min < 



loe 



I + H3S1HJ + H 4 S 2 HT 



log 



1 + HiSiH} 



+ log 



I + H 4 S 2 H 4 



(94) 



Alternatively, we let H3 = 0. By using (46) and Theorem 5, the sum rate also satisfies 



i?i + i? 2 < log 



1 + 



HiSiH^ (l + H 2 S 2 H 2 ) 



+ log 



I + H 4 S 2 H^ 



(95) 



Combining (94) and (95), we have 



Ri + R2 < min < 



loe 



I + H 3 SiH 3 + H 4 S 2 Hj 



i + h 4 s 2 h 4 



> . 



I + h 4 s 2 h 4 



(96) 



log I + HxSiHl (l + H 2 S 2 H 2 ) 1 +log 
log |l + H1S1H} I + log' 

We complete the proof by pointing out that the last line of (96) is redundant because of the second line. 

Proposition 9 is similarly proved by Propositions 1 and 4. Proposition 10 is similarly proved by 
Propositions 3 and 5. 



/. Proof of Theorem 8 

The proof of Theorem 8 follows that of Theorem 5. The bound in problem (53) is derived from (77) by 
assuming x* ~ Following similar steps as in (78), one can verify that the sum-rate 

capacity is (57) if (54) is satisfied. 
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Appendix 

A. Proof of Lemma 1 

Let x* s be a Gaussian vector with covariance matrix Cov(xi 5 s). We have 

(a) 



i=l 



E Ai/i |xi,r) < Yl Xih (x* :S |x* r ) 
i=i 

^A, [/i (x* Sur ) - h (x* r )] 
Gov (< 5u t) 



i=i 

k 



i=l 
fc 



Cov (x* r 



(Tre) 



" h V |Cov(y r 



(7re)" 



= Hvs \ vt), 

where inequality (a) is from [28, Lemma 2], and inequality (b) is from [32, Theorem 17.10.1]. 



(97) 



B. Proof of Lemma 2 

The first inequalities of (61) and (62) are straightforward from Lemma 1. It suffices to prove the second 
inequality of (62). Since (60) holds, we can define two random vectors u and v that are joint Gaussian, 
independent of x* and y* , and satisfy 













X* 




X 




u 








+ 








^* 






V* 




V 




V 



Therefore, 

h (y* \x*)>h iy* \x*,u,v) = h (y* \x* ) . (99) 



C. Proof of Lemma 3 



h {x n + z n )-h (x n + z n + z n ) 

= -I{z n ;x n + z n + ~z n ) 

< -I(z n ;x* n + z n + z n ) 

= -h{z n ) + h{z n \x* n +z n + z n ) 
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(6) 

< — nh (z) +nh(z\ x + z + z) 
= nh (x* + z) — nh(x* + z + z) , 



(100) 



where (a) is from [33, Lemma II.2], and x* n is a Gaussian vector sequence that has the same covariance 
matrix as x n . Inequality (b) is from Lemma 2. Alternatively, we can use Lemma 2 to bound (100) as 

nh (x* + z) — nh(x* + z + z) 
= —nh (z) + nh (z | x* + z + z ) 
< -n/i (z)+nh(z\ x* + z + z) 

= nh(x* +z) -nh(x* + z + z) . (101) 

D. Proof of Lemma 5 

Let the eigenvalue decomposition of S u be 

S u = QAQ f , (102) 

where Q is a unitary matrix and A is a diagonal matrix with strictly positive diagonal elements. Since 

Cov(x,i/)Cov(i/) -1 Cov(i/,jb) = Cov (x, Ay) Cov {Ay)' 1 Cov (Ay,z) (103) 

for any invertible matrix A, we choose A = A~2 Q and then x — > Hx + u — ► Gx + v forms a Markov 
chain if and only if x — Hx + w — > Gx + v forms a Markov chain, where 



H = A 2QH and 
tt = A sQn, 



(104) 



and we have 



Cov (u) = I and 
Cov (u,v) = A~2QS U „ = S„„. 
By Lemma 4, x — > Hx + u — > Gx + v forms a Markov chain if and only if 
S*Gt = S x & (i + HS^Ht) ~* (HS^Gt + S uv ) 

= S x & (l + HS^H 1 ") 1 HS^G 1 + S x & (l + HS^) 1 S 



(105) 



(a) 



I - (i + H^HS^ 1 Gt + S^H* (i + HS X H^ 1 S t 
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(c) 



S X G^ - S x (i + &US X ) ~* G f + S x & (i + HS^Ht) _1 S uv 
S X G^ - (i + S^H)" 1 (s x G^ - S x HtS u „) 

S^Gt - (i + S^H) ~* (s x Gt - SxHtS-^^) (106) 

where (a) is from the matrix inverse identity [34, page 151] 

A (I + BA)" 1 B = I - (I + AB)" 1 . 

Equality (b) is from the matrix inverse identity [34, page 151] 

A (I + BA) -1 = (I + AB) -1 A. 

Equality (c) is from (102), (104) and (105). We complete the proof by pointing out that (106) is equivalent 
to (66). 



E. Proof of Lemma 6 

Let x be a vector with dimension equal to the number of rows of A, and y be a vector with dimension 
equal to the number of columns of A. We have B y At A so that ytBy > y^A^Ay and 

= x j x + y^A^x + x 1 Ay + y^By 

> X s x + y^A^x + x 1 Ay + yt A j Ay 
= (Ay + x) 1 " (Ay + x) 

> 0. 

f I A 



X 


t 


i 


A 




X 


y 




At 


B 




y 



Therefore, sufficiency is proved. On the other hand, if 



y 0, we have 



X 


t 


i 


A 




X 


y 




At 


B 




y 



At B 

x'x + y' A x + x' Ay + y'By > 0. 



We choose x = —Ay and substitute it into (107), then we have 

yt (b - A f A) y > 0. 

Therefore, B y At A. 

If B >- 0, then B y At A is equivalent to 

< y f (b - A f a) y 



(107) 



(108) 
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where 



and 



2/ f B2 (l - B 2 A f AB~2 J B^y 
yt ^i-B-^AtAB-^y, 

B2 = UA2U 1 " and 
y = B2y, 

B = UAU f 



(109) 



is the eigenvalue decomposition of B with U being a unitary matrix and A being a diagonal matrix with 
strictly positive diagonal elements. Since y can be any vector, (109) means 

I y B~2A f AB i 

Suppose that the singular value decomposition of B~2 At is 

S 



B-2A f = P 







where both P and Q are unitary matrices and S is a diagonal matrix with strictly positive diagonal 
elements. Then we have 

r s 



B 2 A T AB 



AB 2B-2A f = Q 





S 




P f and 



Therefore, I y B 2 A^AB 2 if and only if I y ~E which is also the necessary and sufficient condition 
for I y AB 2B 2 A+ = AB x At. 



F. Proof of Lemma 7 

Let A = B (BtB) - Ct and suppose that the singular value decomposition of B is 

S 



B = U 



(110) 
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where both U and V are unitary matrices, and I] is a diagonal matrix with strictly positive diagonal 
elements. Suppose further that 

I y A 1 " A 

= C (BtB^Ct 

= CV5T 2 V t C t . (Ill) 
Lemma 6 implies that X^X ^ I if and only if XX^ ^ I, therefore (111) is equivalent to 

i y s _1 vtctcvs _1 , (112) 

i.e., for any vector x we have 

o < a:* (i - s^vtctcvir 1 ) x 

= x f S 'V f (vS 2 V f - C f c) VS 'a; 

= yt(BtB-Ctc)y, (113) 

where the last line is from (110), and we define y = VS _1 i. Since x can be any vector and X! _1 Vt is 
invertible, y can also be any vector. Therefore, (113) proves Lemma 7. 

G. Proof of Lemma 9 

From (68) and the Woodbury matrix identity [35]: 

(E + CBCt) 1 = E" 1 - E _1 C (b _1 + C^E^Cy 1 C^E -1 , 

we have 

S2 1 = 1- A x (-£1 + A^Ai)" 1 A}. (114) 
Substituting (114) into (67) we have 

£i = I- A 2 Ai, + A 2 Ai (a\ Ai - Si) _1 A{ Aj. (1 15) 



Define 



Xi = Si-AlAi, (116) 
Mi = I - A\ Ai - A 2 Aj, (117) 
M 2 = 1- AiA}- aJa 2 , (118) 
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Wi = a{a5 and (119) 

W 2 = aIa\. (120) 
On substituting (116)-(119) into (115), we have the following matrix equation: 

Xi + w}Xi = Mi. (121) 

Equation (121) is a special case of a discrete algebraic Ricatti equation [31]. From Lemma 8, with Mi 
Hermitian and positive definite, (121) has a positive definite solution Xi (i.e., Ei y A{Ai) if and only 
if 



radius (m i 'WiMj = radius (*i) < i. 



Similarly, applying the Woodbury matrix identity to invert E 2 in (67) and substituting the result into 
(68), we obtain 



where 



X 2 + W^ 1 W 2 = M 2 , (122) 



X 2 — S 2 — Ar,A 2 . 



Matrix equation (122) has a positive definite solution X 2 (i.e., S 2 y A^A 2 ) if and only if 

radius (m~ ^W 2 M~ = radius (* 2 ) < i. 
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